-
Notifications
You must be signed in to change notification settings - Fork 66
Update the dockerfile base image to cuda-dl-base #1248
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
So I run a check in the
For the symlink, we'd just need this part of the @DwarKapex does it sound right to you? |
|
|
The |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, only one minor nit left.
@yhtang and/or @chaserileyroberts to review the GCP networking relevant parts.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What is the symlink-xyz scripts modeled after? How do other DLFW containers accomodate the dl core container?
|
@yhtang |
Bumping the CUDA base image is usually not a light job, as it may break things. Hence why we always update it via a PR. The DL base image is only updated once a month so IMHO we can live with it. |
- Remove some infrastructure missed in #1296 - Fix metric calculation/check for the remaining MaxText tests - Remove the MJX pipeline added in #497, which had been failing for months. - Update the README à la #1143 and #1198 to include dates of the first nightlies to include the base container bumps of #1248, #1276 and #1320 - Add a missing test dependency for Levanter unit tests - Remove some more T5X tests, leaving only a ViT one, and try to fix its metric calculation/check
The prior setup pre-dated #1248, now things can be simpler. --------- Co-authored-by: Steboss <[email protected]> Co-authored-by: Steboss <[email protected]>
Update the base docker image, so we can use cuda-dl-base from
nvcri.